The 3ML workflow

Generally, an analysis in 3ML is performed in 3 steps:

Load the data: one or more datasets are loaded and then listed in a DataList object
Define the model: a model for the data is defined by including one or more PointSource, ExtendedSource or ParticleSource instances
Perform a likelihood or a Bayesian analysis: the data and the model are used together to perform either a Maximum Likelihood analysis, or a Bayesian analysis

Loading data

3ML is built around the concept of plugins. A plugin is used to load a particular type of data, or the data from a particular instrument. There is a plugin of optical data, one for X-ray data, one for Fermi/LAT data and so on. Plugins instances can be added and removed at the loading stage without changing any other stage of the analysis (but of course, you need to rerun all stages to update the results).

First, let's import 3ML:



In [1]:

    
from threeML import *









    



Configuration read from /home/giacomov/.threeML/threeML_config.yml
Plotter is MatPlotlib

Let's start by loading one dataset, which in the 3ML workflow means creating an instance of the appropriate plugin:



In [2]:

    
# Get some example data
from threeML.io.package_data import get_path_of_data_file

data_path = get_path_of_data_file("datasets/xy_powerlaw.txt")

# Create an instance of the XYLike plugin, which allows to analyze simple x,y points
# with error bars
xyl = XYLike.from_text_file("xyl", data_path)

# Let's plot it just to see what we have loaded
xyl.plot(x_scale='log', y_scale='log')









    



Using Gaussian statistic (equivalent to chi^2) with the provided errors.






    Out[2]:

Now we need to create a DataList object, which in this case contains only one instance:



In [3]:

    
data = DataList(xyl)

The DataList object can receive one or more plugin instances on initialization. So for example, to use two datasets we can simply do:



In [4]:

    
# Create the second instance, this time of a different type

pha = get_path_of_data_file("datasets/ogip_powerlaw.pha")
bak = get_path_of_data_file("datasets/ogip_powerlaw.bak")
rsp = get_path_of_data_file("datasets/ogip_powerlaw.rsp")

ogip = OGIPLike("ogip", pha, bak, rsp)

# Now use both plugins
data = DataList(xyl, ogip)









    



Auto-probed noise models:
- observation: poisson
- background: poisson






    



WARNING UserWarning: FILTER is not set. This is not a compliant OGIP file. Assuming no FILTER.


WARNING UserWarning: The default choice for MATRIX extension failed:KeyError("Extension ('MATRIX', 1) not found.",)available: None 'EBOUNDS' 'SPECRESP MATRIX'

The DataList object can accept any number of plugins in input.

You can also create a list of plugins, and then create a DataList using the "expansion" feature of the python language ('*'), like this:



In [5]:

    
# This is equivalent to write data = DataList(xyl, ogip)

my_plugins = [xyl, ogip]
data = DataList(*my_plugins)

This is useful if you need to create the list of plugins at runtime, for example looping over many files.

Define the model

After you have loaded your data, you need to define a model for them. A model is a collection of one or more sources. A source represents an astrophysical reality, like a star, a galaxy, a molecular cloud... There are 3 kinds of sources: PointSource, ExtendedSource and ParticleSource. The latter is used only in special situations. The models are defined using the package astromodels. Here we will only go through the basics. You can find a lot more information here: astromodels.readthedocs.org

Point sources

A point source is characterized by a name, a position, and a spectrum. These are some examples:



In [6]:

    
# A point source with a power law spectrum

source1_sp = Powerlaw()
source1 = PointSource("source1", ra=23.5, dec=-22.7, spectral_shape=source1_sp)

# Another source with a log-parabolic spectrum plus a power law

source2_sp = Log_parabola() + Powerlaw()
source2 = PointSource("source2", ra=30.5, dec=-27.1, spectral_shape=source2_sp)

# A third source defined in terms of its Galactic latitude and longitude
source3_sp = Cutoff_powerlaw()
source3 = PointSource("source3", l=216.1, b=-74.56, spectral_shape=source3_sp)

Extended sources

An extended source is characterized by its spatial shape and its spectral shape:



In [7]:

    
# An extended source with a Gaussian shape centered on R.A., Dec = (30.5, -27.1)
# and a sigma of 3.0 degrees
ext1_spatial = Gaussian_on_sphere(lon0=30.5, lat0=-27.1, sigma=3.0)
ext1_spectral = Powerlaw()

ext1 = ExtendedSource("ext1", ext1_spatial, ext1_spectral)

# An extended source with a 3D function 
# (i.e., the function defines both the spatial and the spectral shape)
ext2_spatial = Continuous_injection_diffusion()
ext2 = ExtendedSource("ext2", ext2_spatial)

NOTE: not all plugins support extended sources. For example, the XYLike plugin we used above do not, as it is meant for data without spatial resolution.

Create the likelihood model

Now that we have defined our sources, we can create a model simply as:



In [8]:

    
model = Model(source1, source2, source3, ext1, ext2)

# We can see a summary of the model like this:
model.display()









    




Model summary:



  
    
      
      N
    
  
  
    
      Point sources
      3
    
    
      Extended sources
      2
    
    
      Particle sources
      0
    
  



Free parameters (19):



  
    
      
      value
      min_value
      max_value
      unit
    
  
  
    
      source1.spectrum.main.Powerlaw.K
      1
      1e-30
      1000
      cm-2 keV-1 s-1
    
    
      source1.spectrum.main.Powerlaw.index
      -2
      -10
      10
      
    
    
      source2.spectrum.main.composite.K_1
      1
      1e-30
      100000
      cm-2 keV-1 s-1
    
    
      source2.spectrum.main.composite.alpha_1
      -2
      None
      None
      
    
    
      source2.spectrum.main.composite.beta_1
      1
      None
      None
      
    
    
      source2.spectrum.main.composite.K_2
      1
      1e-30
      1000
      cm-2 keV-1 s-1
    
    
      source2.spectrum.main.composite.index_2
      -2
      -10
      10
      
    
    
      source3.spectrum.main.Cutoff_powerlaw.K
      1
      1e-30
      1000
      cm-2 keV-1 s-1
    
    
      source3.spectrum.main.Cutoff_powerlaw.index
      -2
      -10
      10
      
    
    
      source3.spectrum.main.Cutoff_powerlaw.xc
      10
      None
      None
      keV
    
    
      ext1.Gaussian_on_sphere.lon0
      30.5
      0
      360
      deg
    
    
      ext1.Gaussian_on_sphere.lat0
      -27.1
      -90
      90
      deg
    
    
      ext1.Gaussian_on_sphere.sigma
      3
      0
      20
      deg
    
    
      ext1.spectrum.main.Powerlaw.K
      1
      1e-30
      1000
      cm-2 keV-1 s-1
    
    
      ext1.spectrum.main.Powerlaw.index
      -2
      -10
      10
      
    
    
      ext2.Continuous_injection_diffusion.lon0
      0
      0
      360
      deg
    
    
      ext2.Continuous_injection_diffusion.lat0
      0
      -90
      90
      deg
    
    
      ext2.Continuous_injection_diffusion.rdiff0
      1
      0
      20
      deg
    
    
      ext2.spectrum.main.Constant.k
      0
      None
      None
      
    
  



Fixed parameters (15):
(abridged. Use complete=True to see all fixed parameters)


Linked parameters (0):

(none)

Independent variables:

(none)

You can easily interact with the model. For example:



In [9]:

    
# Fix a parameter
model.source1.spectrum.main.Powerlaw.K.fix = True
# or
model.source1.spectrum.main.Powerlaw.K.free = False

# Free it again
model.source1.spectrum.main.Powerlaw.K.free = True
# or
model.source1.spectrum.main.Powerlaw.K.fix = False

# Change the value
model.source1.spectrum.main.Powerlaw.K = 2.3
# or using physical units (need to be compatible with what shown 
# in the table above)
model.source1.spectrum.main.Powerlaw.K = 2.3 * 1 / (u.cm**2 * u.s * u.TeV)

# Change the boundaries for the parameter
model.source1.spectrum.main.Powerlaw.K.bounds = (1e-10, 1.0)
# you can use units here as well, like:
model.source1.spectrum.main.Powerlaw.K.bounds = (1e-5 * 1 / (u.cm**2 * u.s * u.TeV), 
                                                 10.0 * 1 / (u.cm**2 * u.s * u.TeV))

# Link two parameters so that they are forced to have the same value
model.link(model.source2.spectrum.main.composite.K_1,
           model.source1.spectrum.main.Powerlaw.K)

# Link two parameters with a law. The parameters of the law become free
# parameters in the fit. In this case we impose a linear relationship
# between the index of the log-parabolic spectrum and the index of the
# powerlaw in source2: index_2 = a * alpha_1 + b. 

law = Line()
model.link(model.source2.spectrum.main.composite.index_2,
           model.source2.spectrum.main.composite.alpha_1,
           law)

# If you want to force them to be in a specific relationship,
# say index_2 = alpha_1 + 1, just fix a and b to the corresponding values,
# after the linking, like:
# model.source2.spectrum.main.composite.index_2.Line.a = 1.0
# model.source2.spectrum.main.composite.index_2.Line.a.fix = True
# model.source2.spectrum.main.composite.index_2.Line.b = 0.0
# model.source2.spectrum.main.composite.index_2.Line.b.fix = True

# Now display() will show the links
model.display()









    




Model summary:



  
    
      
      N
    
  
  
    
      Point sources
      3
    
    
      Extended sources
      2
    
    
      Particle sources
      0
    
  



Free parameters (19):



  
    
      
      value
      min_value
      max_value
      unit
    
  
  
    
      source1.spectrum.main.Powerlaw.K
      2.3e-09
      1e-14
      1e-08
      cm-2 keV-1 s-1
    
    
      source1.spectrum.main.Powerlaw.index
      -2
      -10
      10
      
    
    
      source2.spectrum.main.composite.alpha_1
      -2
      None
      None
      
    
    
      source2.spectrum.main.composite.beta_1
      1
      None
      None
      
    
    
      source2.spectrum.main.composite.K_2
      1
      1e-30
      1000
      cm-2 keV-1 s-1
    
    
      source2.spectrum.main.composite.index_2.Line.a
      1
      None
      None
      
    
    
      source2.spectrum.main.composite.index_2.Line.b
      0
      None
      None
      
    
    
      source3.spectrum.main.Cutoff_powerlaw.K
      1
      1e-30
      1000
      cm-2 keV-1 s-1
    
    
      source3.spectrum.main.Cutoff_powerlaw.index
      -2
      -10
      10
      
    
    
      source3.spectrum.main.Cutoff_powerlaw.xc
      10
      None
      None
      keV
    
    
      ext1.Gaussian_on_sphere.lon0
      30.5
      0
      360
      deg
    
    
      ext1.Gaussian_on_sphere.lat0
      -27.1
      -90
      90
      deg
    
    
      ext1.Gaussian_on_sphere.sigma
      3
      0
      20
      deg
    
    
      ext1.spectrum.main.Powerlaw.K
      1
      1e-30
      1000
      cm-2 keV-1 s-1
    
    
      ext1.spectrum.main.Powerlaw.index
      -2
      -10
      10
      
    
    
      ext2.Continuous_injection_diffusion.lon0
      0
      0
      360
      deg
    
    
      ext2.Continuous_injection_diffusion.lat0
      0
      -90
      90
      deg
    
    
      ext2.Continuous_injection_diffusion.rdiff0
      1
      0
      20
      deg
    
    
      ext2.spectrum.main.Constant.k
      0
      None
      None
      
    
  



Fixed parameters (17):
(abridged. Use complete=True to see all fixed parameters)


Linked parameters (2):



  
    
      
      source2.spectrum.main.composite.K_1
    
  
  
    
      current value
      2.3e-09
    
    
      function
      Line
    
    
      linked to
      source1.spectrum.main.Powerlaw.K
    
    
      unit
      1 / (cm2 keV s)
    
  




  
    
      
      source2.spectrum.main.composite.index_2
    
  
  
    
      current value
      -2.0
    
    
      function
      Line
    
    
      linked to
      source2.spectrum.main.composite.alpha_1
    
    
      unit
      
    
  



Independent variables:

(none)

Now, for the following steps, let's keep it simple and let's use a single point source:



In [10]:

    
new_model = Model(source1)

source1_sp.K.bounds = (0.01, 100)









    



WARNING RuntimeWarning: The current value of the parameter K (2.3e-09) was below the new minimum 0.01.

A model can be saved to disk, and reloaded from disk, as:



In [11]:

    
new_model.save("new_model.yml", overwrite=True)

new_model_reloaded = load_model("new_model.yml")

The output is in YAML format, a human-readable text-based format.

Perform the analysis

Maximum likelihood analysis

Now that we have the data and the model, we can perform an analysis very easily:



In [12]:

    
data = DataList(ogip)

jl = JointLikelihood(new_model, data)

best_fit_parameters, likelihood_values = jl.fit()









    



Best fit values:







    






  
    
      
      result
      unit
    
    
      parameter
      
      
    
  
  
    
      source1.spectrum.main.Powerlaw.K
      (9.0 -3.1 +5) x 10^-1
      1 / (cm2 keV s)
    
    
      source1.spectrum.main.Powerlaw.index
      -1.980 +/- 0.07
      
    
  








    



Correlation matrix:







    





1.00 -0.99
-0.99 1.00







    



Values of -log(likelihood) at the minimum:







    






  
    
      
      -log(likelihood)
    
  
  
    
      ogip
      181.766578
    
    
      total
      181.766578
    
  








    



Values of statistical measures:







    






  
    
      
      statistical measures
    
  
  
    
      AIC
      367.629156
    
    
      BIC
      373.237217

The output of the fit() method of the JointLikelihood object consists of two pandas DataFrame objects, which can be queried, saved to disk, reloaded and so on. Refer to the pandas manual for details.

After the fit the JointLikelihood instance will have a .results attribute which contains the results of the fit.



In [13]:

    
jl.results.display()









    



Best fit values:







    






  
    
      
      result
      unit
    
    
      parameter
      
      
    
  
  
    
      source1.spectrum.main.Powerlaw.K
      (9.0 -3.1 +5) x 10^-1
      1 / (cm2 keV s)
    
    
      source1.spectrum.main.Powerlaw.index
      -1.980 +/- 0.07
      
    
  








    



Correlation matrix:







    





1.00 -0.99
-0.99 1.00







    



Values of -log(likelihood) at the minimum:







    






  
    
      
      -log(likelihood)
    
  
  
    
      ogip
      181.766578
    
    
      total
      181.766578
    
  








    



Values of statistical measures:







    






  
    
      
      statistical measures
    
  
  
    
      AIC
      367.629156
    
    
      BIC
      373.237217

This object can be saved to disk in a FITS file:



In [14]:

    
jl.results.write_to("my_results.fits", overwrite=True)









    



WARNING: AstropyDeprecationWarning: "clobber" was deprecated in version 2.0 and will be removed in a future version. Use argument "overwrite" instead. [astropy.utils.decorators]
WARNING:astropy:AstropyDeprecationWarning: "clobber" was deprecated in version 2.0 and will be removed in a future version. Use argument "overwrite" instead.

The produced FITS file contains the complete definition of the model and of the results, so it can be reloaded in a separate session as:



In [15]:

    
results_reloaded = load_analysis_results("my_results.fits")

results_reloaded.display()









    



Best fit values:







    






  
    
      
      result
      unit
    
    
      parameter
      
      
    
  
  
    
      source1.spectrum.main.Powerlaw.K
      (9.0 -3.1 +5) x 10^-1
      1 / (cm2 keV s)
    
    
      source1.spectrum.main.Powerlaw.index
      -1.980 +/- 0.07
      
    
  








    



Correlation matrix:







    





1.00 -0.99
-0.99 1.00







    



Values of -log(likelihood) at the minimum:







    






  
    
      
      -log(likelihood)
    
  
  
    
      ogip
      181.766578
    
    
      total
      181.766578
    
  








    



Values of statistical measures:







    






  
    
      
      statistical measures
    
  
  
    
      AIC
      367.629156
    
    
      BIC
      373.237217

The flux of the source can be computed from the 'results' object (even in another session by reloading the FITS file), as:



In [16]:

    
fluxes = jl.results.get_point_source_flux(100 * u.keV, 1 * u.MeV)

# Same results would be obtained with
# fluxes = results_reloaded.get_point_source_flux(100 * u.keV, 1 * u.MeV)









    





 
 










    






  
    
      
      flux
    
  
  
    
      source1: total
      (3.9 -1.7 +2.5) x 10^-9 erg / (cm2 s)

We can also plot the spectrum with its error region, as:



In [17]:

    
plot_point_source_spectra(jl.results, ene_min=0.1, ene_max=1e6, num_ene=500, 
                          flux_unit='erg / (cm2 s)')

Bayesian analysis

In a very similar way, we can also perform a Bayesian analysis. As a first step, we need to define the priors for all parameters:



In [18]:

    
# A uniform prior can be defined directly, like:
new_model.source1.spectrum.main.Powerlaw.index.prior = Uniform_prior(lower_bound=-10, 
                                                                     upper_bound=10)

# or it can be set using the currently defined boundaries
new_model.source1.spectrum.main.Powerlaw.index.set_uninformative_prior(Uniform_prior)

# The same for the Log_uniform prior
new_model.source1.spectrum.main.Powerlaw.K.prior = Log_uniform_prior(lower_bound=1e-3, 
                                                                     upper_bound=100)
# or
new_model.source1.spectrum.main.Powerlaw.K.set_uninformative_prior(Log_uniform_prior)

new_model.display()









    




Model summary:



  
    
      
      N
    
  
  
    
      Point sources
      1
    
    
      Extended sources
      0
    
    
      Particle sources
      0
    
  



Free parameters (2):



  
    
      
      value
      min_value
      max_value
      unit
    
  
  
    
      source1.spectrum.main.Powerlaw.K
      0.902795
      0.01
      100
      cm-2 keV-1 s-1
    
    
      source1.spectrum.main.Powerlaw.index
      -1.9774
      -10
      10
      
    
  



Fixed parameters (4):
(abridged. Use complete=True to see all fixed parameters)


Linked parameters (0):

(none)

Independent variables:

(none)

Then, we can perform our Bayesian analysis like:



In [19]:

    
bs = BayesianAnalysis(new_model, data)

# This uses the emcee sampler
samples = bs.sample(n_walkers=30, burn_in=100, n_samples=1000)









    



WARNING RuntimeWarning: External parameter cons_ogip already exist in the model. Overwriting it...







    





 
 










    





 
 










    



Mean acceptance fraction: 0.629966666667

Maximum a posteriori probability (MAP) point:







    






  
    
      
      result
      unit
    
    
      parameter
      
      
    
  
  
    
      source1.spectrum.main.Powerlaw.K
      1.100 +/- 0.4
      1 / (cm2 keV s)
    
    
      source1.spectrum.main.Powerlaw.index
      -1.99 -0.07 +0.06
      
    
  








    



Values of -log(posterior) at the minimum:







    






  
    
      
      -log(posterior)
    
  
  
    
      ogip
      -181.706235
    
    
      total
      -181.706235
    
  








    



Values of statistical measures:







    






  
    
      
      statistical measures
    
  
  
    
      AIC
      367.508470
    
    
      BIC
      373.116531
    
    
      DIC
      365.657072
    
    
      PDIC
      0.118432

The BayesianAnalysis object will now have a "results" member which will work exactly the same as explained for the Maximum Likelihood analysis (see above):



In [20]:

    
bs.results.display()









    



Maximum a posteriori probability (MAP) point:







    






  
    
      
      result
      unit
    
    
      parameter
      
      
    
  
  
    
      source1.spectrum.main.Powerlaw.K
      1.100 +/- 0.4
      1 / (cm2 keV s)
    
    
      source1.spectrum.main.Powerlaw.index
      -1.99 -0.07 +0.06
      
    
  








    



Values of -log(posterior) at the minimum:







    






  
    
      
      -log(posterior)
    
  
  
    
      ogip
      -181.706235
    
    
      total
      -181.706235
    
  








    



Values of statistical measures:







    






  
    
      
      statistical measures
    
  
  
    
      AIC
      367.508470
    
    
      BIC
      373.116531
    
    
      DIC
      365.657072
    
    
      PDIC
      0.118432



In [21]:

    
fluxes_bs = bs.results.get_point_source_flux(100 * u.keV, 1 * u.MeV)









    





 
 










    






  
    
      
      flux
    
  
  
    
      source1: total
      (3.8 -1.6 +2.7) x 10^-9 erg / (cm2 s)



In [22]:

    
plot_point_source_spectra(bs.results, ene_min=0.1, ene_max=1e6, num_ene=500, 
                          flux_unit='erg / (cm2 s)')

We can also produce easily a "corner plot", like:



In [23]:

    
bs.corner_plot()









    Out[23]:



In [ ]:

	value	min_value	max_value	unit
source1.spectrum.main.Powerlaw.K	1	1e-30	1000	cm-2 keV-1 s-1
source1.spectrum.main.Powerlaw.index	-2	-10	10
source2.spectrum.main.composite.K_1	1	1e-30	100000	cm-2 keV-1 s-1
source2.spectrum.main.composite.alpha_1	-2	None	None
source2.spectrum.main.composite.beta_1	1	None	None
source2.spectrum.main.composite.K_2	1	1e-30	1000	cm-2 keV-1 s-1
source2.spectrum.main.composite.index_2	-2	-10	10
source3.spectrum.main.Cutoff_powerlaw.K	1	1e-30	1000	cm-2 keV-1 s-1
source3.spectrum.main.Cutoff_powerlaw.index	-2	-10	10
source3.spectrum.main.Cutoff_powerlaw.xc	10	None	None	keV
ext1.Gaussian_on_sphere.lon0	30.5	0	360	deg
ext1.Gaussian_on_sphere.lat0	-27.1	-90	90	deg
ext1.Gaussian_on_sphere.sigma	3	0	20	deg
ext1.spectrum.main.Powerlaw.K	1	1e-30	1000	cm-2 keV-1 s-1
ext1.spectrum.main.Powerlaw.index	-2	-10	10
ext2.Continuous_injection_diffusion.lon0	0	0	360	deg
ext2.Continuous_injection_diffusion.lat0	0	-90	90	deg
ext2.Continuous_injection_diffusion.rdiff0	1	0	20	deg
ext2.spectrum.main.Constant.k	0	None	None

	value	min_value	max_value	unit
source1.spectrum.main.Powerlaw.K	2.3e-09	1e-14	1e-08	cm-2 keV-1 s-1
source1.spectrum.main.Powerlaw.index	-2	-10	10
source2.spectrum.main.composite.alpha_1	-2	None	None
source2.spectrum.main.composite.beta_1	1	None	None
source2.spectrum.main.composite.K_2	1	1e-30	1000	cm-2 keV-1 s-1
source2.spectrum.main.composite.index_2.Line.a	1	None	None
source2.spectrum.main.composite.index_2.Line.b	0	None	None
source3.spectrum.main.Cutoff_powerlaw.K	1	1e-30	1000	cm-2 keV-1 s-1
source3.spectrum.main.Cutoff_powerlaw.index	-2	-10	10
source3.spectrum.main.Cutoff_powerlaw.xc	10	None	None	keV
ext1.Gaussian_on_sphere.lon0	30.5	0	360	deg
ext1.Gaussian_on_sphere.lat0	-27.1	-90	90	deg
ext1.Gaussian_on_sphere.sigma	3	0	20	deg
ext1.spectrum.main.Powerlaw.K	1	1e-30	1000	cm-2 keV-1 s-1
ext1.spectrum.main.Powerlaw.index	-2	-10	10
ext2.Continuous_injection_diffusion.lon0	0	0	360	deg
ext2.Continuous_injection_diffusion.lat0	0	-90	90	deg
ext2.Continuous_injection_diffusion.rdiff0	1	0	20	deg
ext2.spectrum.main.Constant.k	0	None	None

	source2.spectrum.main.composite.K_1
current value	2.3e-09
function	Line
linked to	source1.spectrum.main.Powerlaw.K
unit	1 / (cm2 keV s)

	source2.spectrum.main.composite.index_2
current value	-2.0
function	Line
linked to	source2.spectrum.main.composite.alpha_1
unit

	result	unit
parameter
source1.spectrum.main.Powerlaw.K	(9.0 -3.1 +5) x 10^-1	1 / (cm2 keV s)
source1.spectrum.main.Powerlaw.index	-1.980 +/- 0.07

	value	min_value	max_value	unit
source1.spectrum.main.Powerlaw.K	0.902795	0.01	100	cm-2 keV-1 s-1
source1.spectrum.main.Powerlaw.index	-1.9774	-10	10

	result	unit
parameter
source1.spectrum.main.Powerlaw.K	1.100 +/- 0.4	1 / (cm2 keV s)
source1.spectrum.main.Powerlaw.index	-1.99 -0.07 +0.06